Multi-channel conversion path position reporting

ABSTRACT

Methods, systems, and apparatuses, including computer programs encoded on computer-readable media, for receiving information regarding a plurality of conversion paths. Each conversion path includes one or more user interactions, corresponds to a user, and ends with a conversion interaction. A list of channel groups is received. Each channel group includes a name and one or more group rules that refer to the characteristics of one or more user interactions of the plurality of conversion paths. Each user interaction in the plurality of conversion paths is grouped based upon the list of channel groups. A selection of channels to visualize is received and for each of the selected channels to visualize the user interactions are aggregated based upon the channel group of the user interaction and a position of the user interaction within the conversion path. Information regarding the aggregated user interactions for each of the selected channels is provided.

BACKGROUND

The internet provides access to a wide variety of content. For instance, images, audio, video, and web pages for a myriad of different topics are accessible through the Internet. The accessible content provides an opportunity to place advertisements. Advertisements can be placed within content, such as a web page, image or video, or the content can trigger the display of one or more advertisements, such as presenting an advertisement in an advertisement slot.

Advertisers decide which ads are displayed within particular content using various advertising management tools. These tools also allow an advertiser to track the performance of various ads or ad campaigns. The parameters used to determine when to display a particular ad can also be changed using advertising management tools.

A variety of available data may be used to generate the performance measures. This data usually includes a combination of data from multiple servers. The amount of the combined data is large enough that performance measures generated from the data can be used to provide an efficient way of understanding the data. Processing of the data to generate useful and accurate performance measures involves a number of obstacles. For instance, if a performance measure is based upon a user's actions over a period of time, the user's actions should be tracked. A cookie can be used to track a user's actions over a period of time. However, if this cookie is removed during the period of time, collection of accurate data tracking the user's actions may be disrupted. The data can contain user actions that include various actions that are significant to an advertiser, which are called conversions. Identifying other actions that contribute to the occurrence of conversions is valuable. The data, however, contains numerous actions that could be associated with conversions. In addition, the data may also contain information regarding user actions that do not contribute to any recorded conversions. Thus, processing the data to provide accurate and reliable performance measures based upon the available information regarding user actions has a number of challenges.

SUMMARY

In general, one innovative aspect of the subject matter described in this specification can be embodied in methods for providing data related to conversion paths that includes receiving information regarding a plurality of conversion paths. Each conversion path includes one or more user interactions, corresponds to a user, and ends with a conversion interaction. A user interaction includes characteristics that describe the user interaction. A list of channel groups is received. Each channel group includes a name and one or more group rules that refer to the characteristics of one or more user interactions of the plurality of conversion paths. Each user interaction in the plurality of conversion paths is grouped based upon the list of channel groups. A selection of channels to visualize is received and for each of the selected channels to visualize the user interactions are aggregated based upon the channel group of the user interaction and a position of the user interaction within the conversion path. Information regarding the aggregated user interactions for each of the selected channels is provided. Other embodiments of this aspect include corresponding systems, apparatuses, and computer-readable media, configured to perform the actions of the method.

BRIEF DESCRIPTION OF THE DRAWINGS

The details of one or more embodiments of the subject matter described in this specification are set forth in the accompanying drawings and the description below. Other features, aspects, and advantages of the subject matter will become apparent from the description, the drawings, and the claims.

FIG. 1 is a block diagram of an example environment in which an advertisement management system manages advertising services in accordance with an illustrative embodiment.

FIG. 2 is a flow diagram of a process for integrating user interaction log data in accordance with an illustrative embodiment.

FIG. 3 is a block diagram that illustrates user interaction data being updated during a user interaction log data integration process in accordance with an illustrative embodiment.

FIG. 4 is a block diagram that illustrates data associated with user interactions in accordance with an illustrative embodiment.

FIG. 5 is a block diagram that illustrates conversion paths in accordance with an illustrative embodiment.

FIG. 6 is a flow diagram of a process for providing information related to one or more channels regarding aggregated user interactions in accordance with an illustrative embodiment.

FIGS. 7A-7D illustrate building a graphical representation of user interactions based upon the user interactions' order in a conversion path for different channels in accordance with an illustrative embodiment.

FIGS. 8A-8D illustrate applying a curve to FIGS. 7A-7D in accordance with an illustrative embodiment.

FIGS. 9A-9E illustrate a graphical representation of user interactions for different channels in accordance with another illustrative embodiment.

FIG. 10 illustrates a sample user interface to display graphs of various channels in accordance with an illustrative embodiment.

FIG. 11 is a block diagram of a computer system in accordance with an illustrative embodiment.

Like reference numbers and designations in the various drawings indicate like elements.

DETAILED DESCRIPTION

Content providers (e.g., advertisers) are provided various reports that disclose various user interactions with content. These reports can illustrate how users are interacting with content. Users can interact with content of various types. Different types of content can be classified into different channels, such as from email, organic search results, paid search results, social network sites, direct browser interactions, etc. Conversions are a type of user interaction that relates to a particular user interaction selected by advertisers. Prior to a conversion, any number of user interactions may occur. These user interactions, along with the conversion, are a conversion path. Reports can be generated that provide an overview of where in time and/or where in position user interactions occur in numerous conversion paths related to a particular channel.

As used throughout this document, user interactions include any presentation of content to a user and any subsequent affirmative actions or non-actions (collectively referred to as “actions” unless otherwise specified) that a user takes in response to presentation of content to the user (e.g., selections of the content following presentation of the content, or no selections of the content following the presentation of the content). Thus, a user interaction does not necessarily require a selection of the content (or any other affirmative action) by the user.

User interaction measures can include one or more of time lag measures (i.e., measures of time from one or more specified user interactions to a conversion), path length measures (i.e., quantities of user interactions that occurred prior to conversions), user interaction paths (i.e., sequences of user interactions that occurred prior to the conversion), assist interaction measures (i.e., quantities of particular user interactions that occurred prior to the conversion), and assisted conversion measures (i.e., quantities of conversions that were assisted by specified content).

FIG. 1 is a block diagram of an example environment in which an advertisement management system manages advertising services in accordance with an illustrative embodiment. The example environment 100 includes a network 102, such as a local area network (LAN), a wide area network (WAN), the Internet, or a combination thereof. The network 102 connects websites 104, user devices 106, advertisers 108, and an advertisement management system 110. The example environment 100 may include many thousands of websites 104, user devices 106, and advertisers 108.

A website 104 includes one or more resources 105 associated with a domain name and hosted by one or more servers. An example website is a collection of web pages formatted in hypertext markup language (HTML) that can contain text, images, multimedia content, and programming elements, such as scripts.

A resource 105 is any data that can be provided over the network 102. A resource 105 is identified by a resource address that is associated with the resource 105, such as a uniform resource locator (URL). Resources 105 can include web pages, word processing documents, portable document format (PDF) documents, images, video, programming elements, interactive content, and feed sources, to name only a few. The resources 105 can include content, such as words, phrases, images and sounds, that may include embedded information (such as meta-information in hyperlinks) and/or embedded instructions. Embedded instructions can include code that is executed at a user's device, such as in a web browser. Code can be written in languages such as JavaScript® or ECMAScript®.

A user device 106 is an electronic device that is under control of a user and is capable of requesting and receiving resources 105 over the network 102. Example user devices 106 include personal computers, mobile communication devices, and other devices that can send and receive data over the network 102. A user device 106 typically includes a user application, such as a web browser, to facilitate the sending and receiving of data over the network 102.

A user device 106 can request resources 105 from a website 104. In turn, data representing the resource 105 can be provided to the user device 106 for presentation by the user device 106. The data representing the resource 105 can include data specifying a portion of the resource or a portion of a user display (e.g., a presentation location of a pop-up window or in a slot of a web page) in which advertisements can be presented. These specified portions of the resource 105 or user display are referred to as advertisement slots.

To facilitate searching of the vast number of resources 105 accessible over the network 102, the environment 100 can include a search system 112 that identifies the resources 105 by crawling and indexing the resources 105 provided on the websites 104. Data about the resources 105 can be indexed based on the resource 105 with which the data is associated. The indexed and, optionally, cached copies of the resources 105 are stored in a search index (not shown).

User devices 106 can submit search queries to the search system 112 over the network 102. In response, the search system 112 accesses the search index to identify resources 105 that are relevant to the search query. In one illustrative embodiment, a search query includes one or more keywords. The search system 112 identifies the resources 105 that are responsive to the query, provides information about the resources 105 in the form of search results and returns the search results to the user devices 106 in search results pages. A search result can include data generated by the search system 112 that identifies a resource 105 that is responsive to a particular search query, and can include a link to the resource 105. An example search result can include a web page title, a snippet of text or a portion of an image extracted from the web page 104, a rendering of the resource 105, and the URL of the web page 104. Search results pages can also include one or more advertisement slots in which advertisements can be presented.

A search result page can be sent with a request from the search system 112 for the web browser of the user device 106 to set an HTTP (HyperText Transfer Protocol) cookie. A cookie can represent, for example, a particular user device 106 and a particular web browser. For example, the search system 112 includes a server that replies to the query by sending the search results page in an HTTP response. This HTTP response includes instructions (e.g., a set cookie instruction) that cause the browser to store a cookie for the site hosted by the server or for the domain of the server. If the browser supports cookies and cookies are enabled, every subsequent page request to the same server or a server within the domain of the server will include the cookie. The cookie can store a variety of data, including a unique or semi-unique identifier. The unique or semi-unique identifier can be anonymized and is not connected with user names. Because HTTP is a stateless protocol, the use of cookies allows an external service, such as the search system 112 or other system, to track particular actions and status of a user over multiple sessions. A user may opt out of tracking user actions, for example, by disabling cookies in the browser's settings.

When a resource 105 or search results are requested by a user device 106 or provided to the user device 106, the advertisement management system 110 receives a request for advertisements to be provided with the resource 105 or search results. The request for advertisements can include characteristics of the advertisement slots that are defined for the requested resource 105 or search results page, and can be provided to the advertisement management system 110. For example, a reference (e.g., URL) to the resource 105 for which the advertisement slot is defined, a size of the advertisement slot, and/or media types that are available for presentation in the advertisement slot can be provided to the advertisement management system 110. Similarly, keywords (i.e., one or more words that are associated with content) associated with a requested resource 105 (“resource keywords”) or a search query for which search results are requested can also be provided to the advertisement management system 110 to facilitate identification of advertisements that are relevant to the resource 105 or search query.

Based on data included in the request for advertisements, the advertisement management system 110 can select advertisements that are eligible to be provided in response to the request (“eligible advertisements”). For example, eligible advertisements can include advertisements having characteristics matching the characteristics of advertisement slots and that are identified as relevant to specified resource keywords or search queries. In some implementations, advertisements having targeting keywords that match the resource keywords, the search query, or portions of the search query are selected as eligible advertisements by the advertisement management system 110.

The advertisement management system 110 selects an eligible advertisement for each advertisement slot of a resource 105 or of a search results page. The resource 105 or search results page is received by the user device 106 for presentation by the user device 106. User interaction data representing user interactions with presented advertisements can be stored in a historical data store 119. For example, when an advertisement is presented to the user via an ad server 114, data can be stored in a log file 116. This log file 116, as more fully described below, can be aggregated with other data in the historical data store 119. Accordingly, the historical data store 119 contains data representing the advertisement impression. For example, the presentation of an advertisement is stored in response to a request for the advertisement that is presented. For example, the ad request can include data identifying a particular cookie, such that data identifying the cookie can be stored in association with data that identifies the advertisement(s) that were presented in response to the request. In some implementations, the data can be stored directly to the historical data store 119.

Similarly, when a user selects (i.e., clicks) a presented advertisement, data representing the selection of the advertisement can be stored in the log file 116, a cookie, or the historical data store 119. In some implementations, the data is stored in response to a request for a web page that is linked to by the advertisement. For example, the user selection of the advertisement can initiate a request for presentation of a web page that is provided by (or for) the advertiser. The request can include data identifying the particular cookie for the user device, and this data can be stored in the advertisement data store.

User interaction data can be associated with unique identifiers that represent a corresponding user device with which the user interactions were performed. For example, in some implementations, user interaction data can be associated with one or more cookies. Each cookie can include content which specifies an initialization time that indicates a time at which the cookie was initially set on the particular user device 106.

The log files 116, or the historical data store 119, also store references to advertisements and data representing conditions under which each advertisement was selected for presentation to a user. For example, the historical data store 119 can store targeting keywords, bids, and other criteria with which eligible advertisements are selected for presentation. Additionally, the historical data store 119 can include data that specifies a number of impressions for each advertisement and the number of impressions for each advertisement can be tracked, for example, using the keywords that caused the advertisement impressions and/or the cookies that are associated with the impressions. Data for each impression can also be stored so that each impression and user selection can be associated with (i.e., stored with references to and/or indexed according to) the advertisement that was selected and/or the targeting keyword that caused the advertisement to be selected for presentation.

The advertisers 108 can submit, to the advertisement management system 110, campaign parameters (e.g., targeting keywords and corresponding bids) that are used to control distribution of advertisements. The advertisers 108 can access the advertisement management system 110 to monitor performance of the advertisements that are distributed using the campaign parameters. For example, an advertiser can access a campaign performance report that provides a number of impressions (i.e., presentations), selections (i.e., clicks), and conversions that have been identified for the advertisements. The campaign performance report can also provide a total cost, a cost-per-click, and other cost measures for the advertisement over a specified period of time. For example, an advertiser may access a performance report that specifies that advertisements distributed using the phrase match keyword “hockey” have received 1,000 impressions (i.e., have been presented 1,000 times), have been selected (e.g., clicked) 20 times, and have been credited with 5 conversions. Thus, the phrase match keyword hockey can be attributed with 1,000 impressions, 20 clicks, and 5 conversions.

As described above, reports that are provided to a particular content provider can specify performance measures measuring user interactions with content that occur prior to a conversion. A conversion occurs when a user performs a specified action, and a conversion path includes a conversion and a set of user interactions occurring prior to the conversion by the user. Any user interaction or user interactions can be deemed a conversion. What constitutes a conversion may vary from case to case and can be determined in a variety of ways. For example, a conversion may occur when a user clicks on an advertisement, is referred to a web page or website, and then consummates a purchase there before leaving the web page or website. As another example, a conversion may occur when a user spends more than a given amount of time on a particular website. Data from multiple user interactions can be used to determine the amount of time at the particular website.

Actions that constitute a conversion can be specified by each advertiser. For example, each advertiser can select, as a conversion, one or more measurable/observable user actions such as, for example, downloading a white paper, navigating to at least a given depth of a website, viewing at least a certain number of web pages, spending at least a predetermined amount of time on a website or web page, or registering on a website. Other actions that constitute a conversion can also be used.

To track conversions (and other interactions with an advertiser's website), an advertiser can include, in the advertiser's web pages, embedded instructions that monitor user interactions (e.g., page selections, content item selections, and other interactions) with advertiser's website, and can detect a user interaction (or series of user interactions) that constitutes a conversion. In some implementations, when a user accesses a web page, or another resource, from a referring web page (or other resource), the referring web page (or other resource) for that interaction can be identified, for example, by execution of a snippet of code that is referenced by the web page that is being accessed and/or based on a URL that is used to access the web page.

For example, a user can access an advertiser's website by selecting a link presented on a web page, for example, as part of a promotional offer by an affiliate of the advertiser. This link can be associated with a URL that includes data (i.e., text) that uniquely identifies the resource from which the user is navigating. For example, the link http://www.example.com/homepage/%affiliate_identifier%promotion_(—)1 specifies that the user navigated to the example.com web page from a web page of the affiliate that is associated with the affiliate identifier number that is specified in the URL, and that the user was directed to the example.com web page based on a selection of the link that is included in the promotional offer that is associated with promotion_(—)1. The user interaction data for this interaction (i.e., the selection of the link) can be stored in a database and used, as described below, to facilitate performance reporting.

When a conversion is detected for an advertiser, conversion data representing the conversion can be transmitted to a data processing apparatus (“analytics apparatus”) that receives the conversion data, and in turn, stores the conversion data in a data store. This conversion data can be stored in association with one or more cookies for the user device that was used to perform the user interaction, such that user interaction data associated with the cookies can be associated with the conversion and used to generate a performance report for the conversion.

Typically, a conversion is attributed to a targeting keyword when an advertisement that is targeted using the targeted keyword is the last clicked advertisement prior to the conversion. For example, advertiser X may associate the keywords “tennis,” “shoes,” and “Brand-X” with advertisements. In this example, assume that a user submits a first search query for “tennis,” the user is presented a search result page that includes advertiser X's advertisement, and the user selects the advertisement, but the user does not take an action that constitutes a conversion. Assume further that the user subsequently submits a second search query for “Brand-X,” is presented with the advertiser X's advertisement, the user selects advertiser X's advertisement, and the user takes action that constitutes a conversion (e.g., the user purchases Brand-X tennis shoes). In this example, the keyword “Brand-X” will be credited with the conversion because the last advertisement selected prior to the conversion (“last selected advertisement”) was an advertisement that was presented in response to the “Brand-X” being matched.

Providing conversion credit to the keyword that caused presentation of the last selected advertisement (“last selection credit”) prior to a conversion is a useful measure of advertisement performance, but this measure alone does not provide advertisers with data that facilitates analysis of a conversion cycle that includes user exposure to, and/or selection of, advertisements prior to the last selected advertisement. For example, last selection credit measures alone do not specify keywords that may have increased brand or product awareness through presentation of advertisements that were presented to, and/or selected by, users prior to selection of the last selected advertisement. However, these advertisements may have contributed significantly to the user subsequently taking action that constituted a conversion.

In the example above, the keyword “tennis” is not provided any credit for the conversion, even though the advertisement that was presented in response to a search query matching the keyword “tennis” may have contributed to the user taking an action that constituted a conversion (e.g., making a purchase of Brand-X tennis shoes). For instance, upon user selection of the advertisement that was presented in response to the keyword “tennis” being matched, the user may have viewed Brand-X tennis shoes that were available from advertiser X. Based on the user's exposure to the Brand-X tennis shoes, the user may have subsequently submitted the search query “Brand-X” to find the tennis shoes from Brand-X. Similarly, the user's exposure to the advertisement that was targeted using the keyword “tennis,” irrespective of the user's selection of the advertisement, may have also contributed to the user subsequently taking action that constituted a conversion (e.g., purchasing a product from advertiser X). Analysis of user interactions, with an advertiser's advertisements (or other content), that occur prior to selection of the last selected advertisement can enhance an advertiser's ability to understand the advertiser's conversion cycle.

A conversion cycle is a period that begins when a user is presented an advertisement and ends at a time at which the user takes action that constitutes a conversion. A conversion cycle can be measured and/or constrained by time or actions and can span multiple user sessions. User sessions are sets of user interactions that are grouped together for analysis. Each user session includes data representing user interactions that were performed by a particular user and within a session window (i.e., a specified period). The session window can be, for example, a specified period of time (e.g., 1 hour, 1 day, or 1 month) or can be delineated using specified actions. For example, a user search session can include user search queries and subsequent actions that occur over a 1 hour period and/or occur prior to a session ending event (e.g., closing of a search browser).

Analysis of a conversion cycle can enhance an advertiser's ability to understand how its customers interact with advertisements over a conversion cycle. For example, if an advertiser determines that, on average, an amount of time from a user's first exposure to an advertisement to a conversion is 20 days, the advertiser can use this data to infer an amount of time that users spend researching alternative sources prior to converting (i.e., taking actions that constitute a conversion). Similarly, if an advertiser determines that many of the users that convert do so after presentation of advertisements that are targeted using a particular keyword, the advertiser may want to increase the amount of money that it spends on advertisements distributed using that keyword and/or increase the quality of advertisements that are targeted using that particular keyword.

Measures of user interactions that facilitate analysis of a conversion cycle are referred to as conversion path performance measures. A conversion path is a set of user interactions by a particular user prior to and including a conversion by the particular user. Conversion path performance measures specify durations of conversion cycles, numbers of user interactions that occurred during conversion cycles, paths of user interactions that preceded a conversion, numbers of particular user interactions that occurred preceding conversions, as well as other measures of user interaction that occurred during conversion cycles, as described in more detail below.

The advertisement management system 110 includes a performance analysis apparatus 120 that determines conversion path performance measures that specify measures of user interactions with content items during conversion cycles. The performance analysis apparatus 120 tracks, for each advertiser, user interactions with advertisements that are provided by the advertiser, determines (i.e., computes) one or more conversion path performance measures, and provides data that cause presentation of a performance report specifying at least one of the conversion path performance measures. Using the performance report, the advertiser can analyze its conversion cycle, and learn how each of its keywords cause presentation of advertisements that facilitate conversions, irrespective of whether the keywords caused presentation of the last selected advertisement. In turn, the advertiser can adjust campaign parameters that control distribution of its advertisements based on the performance report.

Configuration options can be offered to reduce bias in performance reports. Without configuration options, some performance reports can be biased, such as towards short conversion paths. For example, a performance report can be biased towards short conversion paths if data used as a basis for the report includes a percentage of partial conversion paths which is higher than a threshold percentage. A partial conversion path is a conversion path in which some but not all user interaction data for a user is associated with a conversion. A partial conversion path can be included in a report if, for example, the report is generated using a reporting period which is less then the length of a typical conversion cycle for the advertiser who requested the report.

A reporting period determines the maximum length (in days) of a reported conversion cycle because additional data outside of the reporting period is not used to generate the report. A performance report can be based on a reporting period (i.e., lookback window), such that user interactions prior to the reporting period are not considered part of the conversion cycle when generating the report. Such a reporting period is referred to as a “lookback window”. For example, when generating a report with a lookback window of thirty days, available user interaction data representing user actions that occurred between July 1 and July 31 of a given year would be available for a conversion that occurred on July 31 of that year.

If a default lookback window (e.g., thirty days) is used, the performance report can be biased towards short conversion paths if the typical conversion cycle length for a product associated with the report is greater than the default lookback window. For instance, in the example above, a typical conversion cycle for “Brand-X” tennis shoes may be relatively short (e.g., thirty days) as compared to a conversion cycle for a more expensive product, such as a new car. A new car may have a much longer conversion cycle (e.g., ninety days).

Different advertisers or different products for an advertiser can have different associated conversion cycle lengths. For example, an advertiser that sells low cost (e.g., less than $100) products may specify a lookback window of 30 days, while an advertiser that sells more expensive products (e.g., at least $1000) may specify a lookback window of 90 days.

In some implementations, an advertiser 108 can specify a lookback window to use when requesting a performance report, such as by entering a number of days or by selecting a lookback window from a list of specific lookback windows (e.g., thirty days, sixty days, ninety days). Allowing an advertiser to configure the lookback window of their performance reports enables the advertiser to choose a lookback window that corresponds to conversion cycles specific to their products. Allowing lookback window configuration also enables advertisers to experiment with different lookback windows, which can result in the discovery of ways to improve conversion rates.

Other factors can contribute to reporting on partial conversion paths. For example, as mentioned above, user interaction data used as a basis for a report can be associated with unique identifiers that represent a user device with which the user interactions were performed. As described above, a unique identifier can be stored as a cookie. Cookies can be deleted from user devices, such as by a user deleting cookies, a browser deleting cookies (e.g., upon browser exit, based on a browser preference setting), or some other software (e.g., anti-spyware software) deleting cookies.

If cookies are deleted from a user device, a new cookie will be set on the user's device when the user visits a web page (e.g., the search system 112). The new cookie may be used to store a new quasi-unique identifier, and thus subsequent user interaction data that occurs on the user device may be associated with a different identifier. Therefore, because each user identifier is considered to represent a different user, the user interaction data associated with the deleted cookies are identified as being associated with a different user than the user interaction data that is associated with the new cookies.

For instance, in the example above, assume that the user deletes cookies after the first search query for “tennis” is performed and that the second search query for “Brand-X” occurs after the cookies are deleted. In this example, performance measures computed based on the user interaction data for the user can show a bias. For example, a path length measure can be computed as one, rather than two, since the advertisement selection resulting from the first search query is not considered part of the same conversion cycle as the advertisement selection resulting from the second search query, since the two user interactions do not appear to have been performed by the same user.

To view a report which reduces bias caused from partial conversion paths, an advertiser can specify a lookback window for the report. As described above, the lookback window specifies that the user interaction data used to generate the report are user interaction data that are associated with unique identifiers that have initialization times that are prior to a specified period (e.g., thirty days, sixty days, ninety days) before the conversions. Thus, conversions for which user interaction data that are associated with unique identifiers having initialization times that are after the specified period are excluded from inclusion as a basis for the report. A unique identifier that has a recent initialization time indicates that the unique identifier may have been recently reinitialized on the user device that the unique identifier represents. Accordingly, user interaction data associated with the relatively new unique identifier may represent only a partial conversion path. Alternatively, conversions for which user interaction data that are associated with unique identifiers having initialization times that are after the specified period are included in the report. To reduce bias, any user interaction included in the conversion path that occurred after the specified period are removed from the conversion path prior to being included in the report.

FIG. 2 is a flow diagram of a process for integrating user interaction log data in accordance with an illustrative embodiment. The process 200 is a process that updates conversion paths and determines conversions based upon the updated conversion paths of users.

The process 200 can be implemented on the advertisement management system 110, the performance analysis apparatus 120, or another computing device. In one implementation, the process 200 is encoded on a computer-readable medium that contains instructions that when executed by a computing device cause the computing device to perform operations of process 200.

As described above, log files 116 may contain user interaction data. A log file 116 may be combined with user interaction data from other logs from other servers, including those that implement the search system 112, prior to processing. Processing starts with the computing device that implements the process 200 determining that a new log is available for processing (210). For example, a notification can be sent to the computing device indicating that a new log is ready for processing, or the existence of a new log can indicate that the new log is ready for processing.

Next, the new log is retrieved (220). The new log may be retrieved over the network 102. The stateful history for each user is updated based upon the user activity indicated by the new log. The new log can contain information relating to user interactions for numerous users. The historical data store 119 contains user interaction data from previously processed log files. The user interaction data contained within the historical data store 119 can be stateful, in that the user interaction data can be grouped by user identifier and ordered chronologically. FIG. 3 is a block diagram that illustrates user interaction data being updated during a user interaction log data integration process 200 in accordance with an illustrative embodiment. FIG. 3 illustrates four example user identifiers, although the historical data store 119 and log files 116 can contain data associated with thousands or millions of different user identifiers. In one embodiment, previously stored user interaction data 310 are stored in the historical data store 119. As illustrated, no user interaction data associated with user identifier 3 has been previously stored in the historical data store 119.

The new log can contain user interaction data for one or more user identifiers. The user interaction data can be grouped by user identifiers and then sorted chronologically (230). Column 320 illustrates grouped and sorted user interaction data. As illustrated, user identifier 2 does not include any new user interaction data, and user identifiers 1, 3, and 4 have updated user interaction data. For instance, the new log file includes user interaction data associated with user interactions a₁₃ and a₁₄ that are associated with user identifier 1. The grouped and sorted user interaction data can then merged with the user interaction data stored in the historical data store 119 (240). If a user identifier previously existed in the historical data store 119, the new user interaction data are added to the previous user interaction data. Otherwise, the new user interaction data is added with a new user identifier.

Column 330 illustrates the updated user interaction data for each of the user identifiers. Based upon the updated user interaction data, any conversions that occurred in each of the updated paths of user interactions can be determined (250). User interaction paths are constrained to those user interactions that are related to a particular advertiser 108. The conversion interactions of the particular advertiser 108 are used to determine if a conversion has occurred. As an example, assume that user interactions a₁₃ and a₃₂ represent conversion interactions. Accordingly, conversion paths 340 and 350 are found. Once found, the conversion paths can be written to another portion of the historical data store 119 or another data store for further analysis.

Each user interaction includes a set of data or dimensions associated with the user interaction. The dimensions can be sparsely populated, such that, any user interaction may have data relating to a subset of the dimensions. A large number of conversion paths can be generated based upon received user interaction data. Various reports regarding how a campaign or an advertiser's placements are performing can include various information regarding the conversion paths. Given the large potential number of conversion paths, various conversion paths can be grouped together to reduce the number of distinct conversion paths that are reported. In an illustrative embodiment, conversion paths that have the same number of user interactions and have corresponding data can be aggregated.

In one embodiment, users are able to create various groups to classify individual user interactions. A group includes a group definition that includes one or more group rules that determine if a particular user interaction belongs to a particular group. The group rules use the dimensional data of the user interaction to determine the group of a user interaction. Boolean operators such as AND, NOT, OR, etc. can be used to join various group rules in a group definition. Each group also includes a group name. In some embodiments, a group can include display information, such as, but not limited to, a text color and/or background color used to display the group name. Default groups may also be available to users. When default groups are available, a user can copy a default group, including the associated group rules, and then modify one or more of the group rules and/or the group name. User created groups can be stored in a data store, such as a local or remote database. The groups can then be accessed, modified, or deleted at a later time.

One or more groups can be associated with one another in a sorted or ordered list of grouping definitions. The groups within the ordered list are used to determine the group for each user interaction. The ordering of the list determines the priority of a particular group. A user interaction is grouped with the matching group that has the highest priority. A matching group of a lower priority will be ignored.

Using the ordered list of grouping definitions, each conversion path can be converted into a group path. A group path contains group elements that correspond to the user interactions of a conversion path. The group element can contain or reference data from the corresponding user interaction. In addition, the group element contains or references the group name and display information of the matching group.

In one embodiment, conversion paths are converted into group paths by adding a reference to the matching group to each of the user interactions. In another embodiment, group paths that are separate from the conversion paths are created. In this embodiment, the group paths can be stored in the same or in a different location from the location where the conversion paths are stored. Regardless of how the group paths are implemented, the group paths can be aggregated based upon the length of the group path and the group name of the group elements that make up the group path.

In one embodiment, the group paths contain various data from the corresponding conversion path. For example, a conversion path can include a monetary value associated with the conversion. As the group paths are aggregated, the value of all conversion paths associated with the aggregated group paths can also be aggregated. This aggregated value can be included in a report.

FIG. 4 is a block diagram that illustrates data associated with user interactions in accordance with an illustrative embodiment. A shell or form user interaction 400 illustrates four possible dimensions that can be associated with any user interaction. As illustrated, each user interaction can have a dimension related to a user interaction's source 402, medium 404, campaign 406, and keywords 408. The source dimension 402 indicates the source of a referral to a website. The medium dimension 404 provides further information regarding the source dimension 402. For instance, the source dimension 402 can indicate a particular search engine as the referring source, and the medium dimension 404 can further classify the source as being “cost-per-click” if the user clicked on a sponsored link or “organic” if the user clicked on an unpaid search results link. Other non-limiting examples of medium values include email, RSS feed, etc. The campaign dimension 406 allows an advertiser to differentiate between various advertising campaigns. For example, an advertiser may have two concurrently running advertising campaigns that refer users to one or more common page(s). The campaign dimension 406 allows the various user interactions related to the campaigns to be separated from one another. This gives an advertiser the ability to analyze the campaigns independently of one another, even though both campaigns drive users to the same common pages. The keyword dimension 408 contains any word or phrase the user used in a search. The available dimensions are not limited to these examples. For example, the dimensions relating to a user interaction can include, but are not limited to, an advertising network type, a date of the user interaction, a time of the user interaction, country/territory, landing page title, browser name, browser version, content, etc. For example, a user interaction can include a content dimension that allows an advertiser to indicate a version of the advertisement that the user clicked.

Conversion path 410 illustrates three user interactions 420, 430, and 440. User interaction 420 has dimensional data associated with the source dimension 422, the medium dimension 424, and the keywords dimension 428. The campaign dimension 426, however, has no associated data. In an illustrative embodiment, the user interaction 420 can be associated with a user searching using the keyword “coupon” in a search engine. The unpaid search results contain a link to the advertiser's website, which the user clicked upon. User interaction 430 provides an example of a user interaction whose dimensions are sparsely populated. Only the source dimension 432 has associated data. The remaining dimensions, 434, 436, and 438 do not have any associated dimensional data. User interaction 430 can correspond to, but is not limited to, a user visiting an advertiser's web page by typing the advertiser's web page address or URL directly into an address tool of a web browser. The third user interaction 440 in the conversion path 410 has all of the illustrated dimensions 442, 444, 446, and 448 populated with dimensional data. User interaction 440 can be associated with, but is not limited to, a user clicking on an advertising link that directs the user to the advertiser's web page. The advertising link is associated with the search results of a keyword search using “coupons” as the keyword in the search engine. Finally, the campaign dimension 446 indicates that the clicked advertising link corresponds to a “spring_sale” campaign.

Conversion path 450 illustrates another conversion path that includes three user interactions 460, 470, and 480. User interaction 460 has dimensional data associated with the source dimension 462, the medium dimension 464, and the campaign dimension 466. The keywords dimension 468 does not have any associated data. The medium dimension 464 indicates that a user was referred to an advertiser's web page based upon the referral_url.com web page based upon the source dimension 462. The keywords dimension 468 being empty can signify that a user did not do a keyword search associated with the referring site to navigate to the advertiser's web page. A user interaction 470 has dimensional data associated with the source dimension 472, the medium dimension 474, and the keywords dimension 478. The campaign dimension 476 has no associated data. In an illustrative embodiment, the user interaction 470 can correspond to a user visiting an advertiser's web page by clicking on an unpaid search result from a search engine using the keyword “sale.” User interaction 480 can be associated with a newsletter that contains a link to the advertiser's web page. The medium dimension 484 is set to “email” and indicates that the user navigated to the advertiser's web page from an email newsletter that corresponds to the “spring newsletter,” based upon the source dimension 482. User interaction 480 also is related to the “spring_sale” campaign as indicated by the campaign dimension 486.

Conversion paths 410 and 450 are two illustrative conversion paths. An advertiser is likely to have a significantly larger number of conversion paths. Individual conversion paths can also include fewer or more user interactions than those illustrated in FIG. 4, and fewer or more dimensions. Given the number of conversion paths for a particular advertiser, reporting on each individual conversion path can be overwhelming. To provide useful reporting metrics, the multitude of conversion paths can be aggregated together before a report is generated. Prior to such aggregation, however, the conversion paths can be filtered based upon various criteria. For instance, the conversion paths can be filtered by, but not limited to, conversion path length, time lag measurements, source dimension, medium dimension, campaign dimension, keywords dimension, any other user interaction dimension, etc. User interactions can also be grouped into various groups prior to the aggregation of the conversion paths. Aggregation can be based upon a user interaction's group instead of directly based upon dimensional data of the user interaction. Groups can include one or more group rules that are used to determine if a user interaction matches the group. Numerous groups can be defined and sorted lists of the grouping definitions can be created.

In one illustrative embodiment, groups can be associated with various channels. A channel generally relates to a source of a user interaction and can be determined based upon the dimensional data of a user interaction. Examples of channels include organic search results interactions, paid search results interactions, direct interactions, display interactions, email interactions, social network interactions, referral interactions, etc. The organic search results channel indicates that a user interaction originated from search results from a search engine. In contrast to the organic search results, user interactions are classified as part of the paid search results channel based upon a user interaction with a paid search result. Display interactions, email interactions, social network interactions, and referral interactions correspond with user interactions that originate from a displayed resource, email, social network site resource, or a referral site resource. A direct interaction corresponds to a URL that is typed directly into or accessed directly with a web browser. The channels or groups that define the channels can be used to classify each user interaction as being part of a particular channel.

FIG. 5 is a block diagram that illustrates conversion paths in accordance with an illustrative embodiment. FIG. 5 illustrates four conversion paths 502, 512, 524, and 534. Each conversion path includes user interactions that are associated with particular channels. For instance, conversion path 502 includes an organic search channel user interaction 504, a paid search channel user interaction 506, a direct interaction channel user interaction 508, and a conversion 510. Conversion paths 512, 524, and 532 contain user interactions that have also been associated with channels. In addition to being associated with a particular channel, each user interaction is also associated with a date related to the user interaction. The date can represent when a particular user interaction occurred. In one embodiment, this information is part of the dimensional data of each user interaction.

In an illustrative embodiment, the user interactions can be aggregated together based upon one or more channels and display data can be provided related to each channel. FIG. 6 is a flow diagram of a process for providing information related to one or more channels regarding aggregated user interactions in accordance with an illustrative embodiment. The process 600 can be implemented on the advertisement management system 110, the performance analysis apparatus 120, or another computing device. In one implementation, the process 600 is encoded on a computer-readable medium that contains instructions that when executed by a computing device cause the computing device to perform operations of process 600.

Information regarding a plurality of conversion paths is received (602). In one embodiment, this information can be retrieved from a data store, such as the historical data store 119. Selection of conversion paths can include filtering of unwanted conversion paths such as those that appear to be invalid or do not meet some initial search criteria, such as retrieving all conversion paths that have conversions in the past 30 days. In addition, conversion paths can be filtered based upon other criteria, such as filtering out all conversion paths except those that are related to a particular type of conversion. In yet another embodiment, only conversion paths that are complete can be selected. A complete conversion path is one that corresponds with a cookie that is at least as old as a lookback window. This allows shorter conversion paths, which can be associated with users who do not accept or regularly delete cookies, to be excluded. A list of channel groups is also received (604). The list of channel groups indicates the channels for which data should be determined. One or more channel groups can be stored in a data store. The one or more channel groups can be queried and received from the data store. In one embodiment, a subset of the channel groups can be retrieved from the data store. The subset of channel groups received can be based upon information from a user. For each of the channels, the user interactions are aggregated based upon the channel group of the user interaction and/or a position of the user interaction within the conversion path (606). In one embodiment, a channel associated with a user interaction is determined based upon group rules that are associated with each selected channel group. The position of the user interaction can relate to the date of the user interaction or the position of the user interaction within a conversion path. After the user interactions are aggregated, information regarding the aggregated information for each channel can be provided (608).

In one embodiment, a line graph for each channel can be generated and provided that illustrates a contribution of the channel over an entire conversion path. FIGS. 7A-7D illustrate building a graphical representation of user interactions based upon the user interactions' order in a conversion path for different channels in accordance with an illustrative embodiment. In FIGS. 7A-7D, the order relates to a user interaction's position in a conversion path relative to the conversion user interaction. In another embodiment, the order can refer to the date of the user interaction relative to the conversion user interaction. FIGS. 7A-7D relate to the data that is illustrated in FIG. 5. Specifically, FIG. 7A relates to the Organic Search channel, FIG. 7B relates to the Paid Search channel, FIG. 7C relates to the Direct Interaction channel, and FIG. 7D relates to the Display Interaction channel. In addition, FIGS. 7A-7D visualize the behavior of these channels in the conversion paths illustrated in FIG. 5.

The longest conversion path illustrated in FIG. 5 is conversion path 512. Conversion path 512 has four user interactions, not including the conversion user interaction 522. Accordingly, FIGS. 7A-7D include four possible points, with each point representing a position in a conversion path. Thus, the number of possible points in a graph can be set to the maximum conversion path length from a group of conversion paths. In another embodiment, only the last certain number of user interactions are included in the graphs. For example, only the last 3, 5, 10, 15, etc., user interactions can be included. Any user interactions beyond the last certain number of user interactions are truncated and not included in the graph. Benefits of this embodiment can include retrieving less conversion path data and allowing for better use of horizontal space. Better use of horizontal spacing can occur if there are a few relatively long conversion paths compared to a majority of shorter conversion paths. In an alternative embodiment, the horizontal axis can use a logarithmic scale. In one embodiment, the points along the horizontal axis, moving away from the conversion user interaction, can correspond to user interaction number 10, 100, 1000, etc. in the conversion paths. In another embodiment, the points along the horizontal axis can refer to time. For example, the points can correspond to 10, 100, 1000, etc. minutes, hours, or days.

Once the horizontal spacing is determined, the channel of each user interaction can be determined. Next, the position of the user interaction within the conversion path can be determined. The order of these operations is not determinative to providing aggregated information, and can be reversed. Once the channel and position of the user interaction are determined, the associated point in the associated channel graph can be determined. In one embodiment, each point is associated with one position in a conversion path. Accordingly, the position of the user interaction determines the point in the graph. In another embodiment, each point in the graph may be associated with one or more positions in the conversion path. For example, each point may be associated with two consecutive positions in a conversion path.

Once the point associated with a user interaction is determined, a value corresponding to the point can be updated. In one embodiment, the value is incremented by a factor. The factor can be, but is not limited to, 1, 2, 5, 10, etc. In one embodiment, the value has an associated minimum value. If after all of the user interactions have been properly assigned to the point, the value is below the minimum value, the value resorts to a default value, such as zero. The value can also have an associated maximum value, such that the value will not exceed the maximum value.

Once the values for each point for each channel have been determined, the graph data or other display data is generated. In one embodiment, a dot, such as 702, can be illustrated above the respective point. Dot 702 can represent one or more user interactions that are associated with point 704. In one embodiment, each dot represents a single user interaction that corresponds with the associated point. In another embodiment, dots above any particular point can be of logarithmic scale. The dots represent the amount of user interactions at any particular position within a conversion path. The height of the dots can be used to create a line graph. In one embodiment, the line graphs are curved line graphs. FIGS. 8A-8D illustrate fitting a curve to FIGS. 7A-7D in accordance with an illustrative embodiment.

FIGS. 7A-7D and 8A-8D illustrate line graphs based upon the length conversion paths. The horizontal spacing of the graphs can be based upon other data. FIGS. 9A-9D illustrate a graphical representation of user interactions for different channels in accordance with another illustrative embodiment. Similar to FIGS. 7A-7D and 8A-8D, FIGS. 9A-9D illustrate data relating to the conversion paths illustrated in FIG. 5. In FIGS. 9A-9D, the horizontal spacing is based upon the date of the user interactions instead of the length of conversion paths. In one embodiment, the horizontal spacing can correspond to a particular number of days. The particular number of days can be set to or based upon a lookback window. In the illustrated embodiment, the horizontal spacing corresponds to 30 days, with each point being a particular date. The days can be counted backward from the conversion. Conversion paths whose earliest user interaction is less than 30 days in the past would therefore not start at the first day but rather at the number of days from the conversion user interaction. If all the conversion paths are shorter than the number of days in a horizontal spacing, the horizontal spacing can automatically be adjusted. The adjustment can change the horizontal spacing from its initial setting to a horizontal spacing that corresponds with the maximum time lag to a conversion. Such an automatic change to the horizontal spacing effectively stretches the original data over the new horizontal spacing and provides for better use of the horizontal spacing.

In another embodiment, the horizontal spacing can be based upon any number of days. Once received, the number of days can be used to exclude user interactions older than the received number of days. The remaining user interactions can be plotted accordingly. The number of days can also be automatically set to include a certain percentage of conversion paths in their entirety. The number of days, therefore, would be set such that the certain percentage of all conversions occur within the determined number of days. As an illustrative example, the number of days can be calculated such that all of the user interactions of 50%, 75%, 85%, 90%, 95%, etc., of the received conversion paths are illustrated in the graphs.

Once the horizontal spacing is determined, the point that corresponds to each user interaction can be determined. After each user interaction is plotted, a curved line can be drawn to encompass all of the user interactions. In one embodiment, a curved line, such as 902, is drawn from point to point across the horizontal spacing. In another embodiment, a curve only connects consecutive points. FIG. 9E illustrates the same data as FIG. 9B, but the curve does not connect each of the points directly. Instead a curve 904 returns to the axis instead of connecting to point 906. Additionally, curve 908 is drawn based upon the data associated with point 906.

A graph can be generated for a number of channels. In one embodiment, a graph is generated for every known channel. In another embodiment, a selective list of one or more channels is received, and a graph for each of the received channels is generated. FIG. 10 illustrates a sample user interface to display graphs of various channels in accordance with an illustrative embodiment. The sample user interface includes a number of graphs 1002, 1004, 1006, 1008, 1010, and 1014 that are generated in accordance with an illustrative embodiment. As illustrated, the area under the graph line can be filled in with a color. In addition to the graphs, the user interface can include further information 1016 regarding the conversions associated with a particular channel. For instance, that total number of conversions and/or the percentage of conversions associated with each channel can be provided. Other information can also be provided in addition or in the alternative. For example, the average position of a user interaction, the average conversion path length, and/or the value associated with the conversions can be provided. The user interface can also include information 1018 regarding all conversions that are illustrated in the graphs.

The advertisement management system 110 and/or the performance analysis apparatus 120 can be realized by instructions that upon execution cause one or more processing devices to carry out the processes and functions described above. Such instructions can comprise, for example, interpreted instructions, such as script instructions, executable code, or other instructions stored in a computer-readable medium. The advertisement management system 110 and/or the performance analysis apparatus 120 can be distributively implemented over a network, such as a server farm, or can be implemented in a single computer device.

FIG. 11 illustrates a depiction of a computer system 1100 that can be used to provide user interaction reports, process log files, implement an illustrative performance analysis apparatus 120, or implement an illustrative advertisement management system 110. The computing system 1100 includes a bus 1105 or other communication component for communicating information and a processor 1110 coupled to the bus 1105 for processing information. The computing system 1100 also includes main memory 1115, such as a random access memory (RAM) or other dynamic storage device, coupled to the bus 1105 for storing information, and instructions to be executed by the processor 1110. Main memory 1115 can also be used for storing position information, temporary variables, or other intermediate information during execution of instructions by the processor 1110. The computing system 1100 may further include a read only memory (ROM) 1110 or other static storage device coupled to the bus 1105 for storing static information and instructions for the processor 1110. A storage device 1125, such as a solid state device, magnetic disk or optical disk, is coupled to the bus 1105 for persistently storing information and instructions.

The computing system 1100 may be coupled via the bus 1105 to a display 1135, such as a liquid crystal display, or active matrix display, for displaying information to a user. An input device 1130, such as a keyboard including alphanumeric and other keys, may be coupled to the bus 1105 for communicating information, and command selections to the processor 1110. In another embodiment, the input device 1130 has a touch screen display 1135. The input device 1130 can include a cursor control, such as a mouse, a trackball, or cursor direction keys, for communicating direction information and command selections to the processor 1110 and for controlling cursor movement on the display 1135.

According to various embodiments, the processes that effectuate illustrative embodiments that are described herein can be implemented by the computing system 1100 in response to the processor 1110 executing an arrangement of instructions contained in main memory 1115. Such instructions can be read into main memory 1115 from another computer-readable medium, such as the storage device 1125. Execution of the arrangement of instructions contained in main memory 1115 causes the computing system 1100 to perform the illustrative processes described herein. One or more processors in a multi-processing arrangement may also be employed to execute the instructions contained in main memory 1115. In alternative embodiments, hard-wired circuitry may be used in place of or in combination with software instructions to implement illustrative embodiments. Thus, embodiments are not limited to any specific combination of hardware circuitry and software.

Although an example processing system has been described in FIG. 11, implementations of the subject matter and the functional operations described in this specification can be implemented in other types of digital electronic circuitry, or in computer software, firmware, or hardware, including the structures disclosed in this specification and their structural equivalents, or in combinations of one or more of them.

Embodiments of the subject matter and the operations described in this specification can be implemented in digital electronic circuitry, or in computer software, firmware, or hardware, including the structures disclosed in this specification and their structural equivalents, or in combinations of one or more of them. Embodiments of the subject matter described in this specification can be implemented as one or more computer programs, i.e., one or more modules of computer program instructions, encoded on one or more computer storage medium for execution by, or to control the operation of, data processing apparatus. Alternatively or in addition, the program instructions can be encoded on an artificially-generated propagated signal, e.g., a machine-generated electrical, optical, or electromagnetic signal, that is generated to encode information for transmission to suitable receiver apparatus for execution by a data processing apparatus. A computer storage medium can be, or be included in, a computer-readable storage device, a computer-readable storage substrate, a random or serial access memory array or device, or a combination of one or more of them. Moreover, while a computer storage medium is not a propagated signal, a computer storage medium can be a source or destination of computer program instructions encoded in an artificially-generated propagated signal. The computer storage medium can also be, or be included in, one or more separate components or media (e.g., multiple CDs, disks, or other storage devices). Accordingly, the computer storage medium is both tangible and non-transitory.

The operations described in this specification can be implemented as operations performed by a data processing apparatus on data stored on one or more computer-readable storage devices or received from other sources.

The term “data processing apparatus” or “computing device” encompasses all kinds of apparatus, devices, and machines for processing data, including by way of example a programmable processor, a computer, a system on a chip, or multiple ones, or combinations, of the foregoing. The apparatus can include special purpose logic circuitry, e.g., an FPGA (field programmable gate array) or an ASIC (application-specific integrated circuit). The apparatus can also include, in addition to hardware, code that creates an execution environment for the computer program in question, e.g., code that constitutes processor firmware, a protocol stack, a database management system, an operating system, a cross-platform runtime environment, a virtual machine, or a combination of one or more of them. The apparatus and execution environment can realize various different computing model infrastructures, such as web services, distributed computing and grid computing infrastructures.

A computer program (also known as a program, software, software application, script, or code) can be written in any form of programming language, including compiled or interpreted languages, declarative or procedural languages, and it can be deployed in any form, including as a stand-alone program or as a module, component, subroutine, object, or other unit suitable for use in a computing environment. A computer program may, but need not, correspond to a file in a file system. A program can be stored in a portion of a file that holds other programs or data (e.g., one or more scripts stored in a markup language document), in a single file dedicated to the program in question, or in multiple coordinated files (e.g., files that store one or more modules, sub-programs, or portions of code). A computer program can be deployed to be executed on one computer or on multiple computers that are located at one site or distributed across multiple sites and interconnected by a communication network.

The processes and logic flows described in this specification can be performed by one or more programmable processors executing one or more computer programs to perform actions by operating on input data and generating output. The processes and logic flows can also be performed by, and apparatus can also be implemented as, special purpose logic circuitry, e.g., an FPGA (field programmable gate array) or an ASIC (application-specific integrated circuit).

Processors suitable for the execution of a computer program include, by way of example, both general and special purpose microprocessors, and any one or more processors of any kind of digital computer. Generally, a processor will receive instructions and data from a read-only memory or a random access memory or both. The essential elements of a computer are a processor for performing actions in accordance with instructions and one or more memory devices for storing instructions and data. Generally, a computer will also include, or be operatively coupled to receive data from or transfer data to, or both, one or more mass storage devices for storing data, e.g., magnetic, magneto-optical disks, or optical disks. However, a computer need not have such devices. Moreover, a computer can be embedded in another device, e.g., a mobile telephone, a personal digital assistant (PDA), a mobile audio or video player, a game console, a Global Positioning System (GPS) receiver, or a portable storage device (e.g., a universal serial bus (USB) flash drive), to name just a few. Devices suitable for storing computer program instructions and data include all forms of non-volatile memory, media and memory devices, including by way of example semiconductor memory devices, e.g., EPROM, EEPROM, and flash memory devices; magnetic disks, e.g., internal hard disks or removable disks; magneto-optical disks; and CD-ROM and DVD-ROM disks. The processor and the memory can be supplemented by, or incorporated in, special purpose logic circuitry.

To provide for interaction with a user, embodiments of the subject matter described in this specification can be implemented on a computer having a display device, e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor, for displaying information to the user and a keyboard and a pointing device, e.g., a mouse or a trackball, by which the user can provide input to the computer. Other kinds of devices can be used to provide for interaction with a user as well; for example, feedback provided to the user can be any form of sensory feedback, e.g., visual feedback, auditory feedback, or tactile feedback; and input from the user can be received in any form, including acoustic, speech, or tactile input. In addition, a computer can interact with a user by sending documents to and receiving documents from a device that is used by the user; for example, by sending web pages to a web browser on a user's client device in response to requests received from the web browser.

Embodiments of the subject matter described in this specification can be implemented in a computing system that includes a back-end component, e.g., as a data server, or that includes a middleware component, e.g., an application server, or that includes a front-end component, e.g., a client computer having a graphical user interface or a Web browser through which a user can interact with an implementation of the subject matter described in this specification, or any combination of one or more such back-end, middleware, or front-end components. The components of the system can be interconnected by any form or medium of digital data communication, e.g., a communication network. Examples of communication networks include a local area network (“LAN”) and a wide area network (“WAN”), an inter-network (e.g., the Internet), and peer-to-peer networks (e.g., ad hoc peer-to-peer networks).

The computing system can include clients and servers. A client and server are generally remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other. In some embodiments, a server transmits data (e.g., an HTML page) to a client device (e.g., for purposes of displaying data to and receiving user input from a user interacting with the client device). Data generated at the client device (e.g., a result of the user interaction) can be received from the client device at the server.

While this specification contains many specific implementation details, these should not be construed as limitations on the scope of any inventions or of what may be claimed, but rather as descriptions of features specific to particular embodiments of particular inventions. Certain features that are described in this specification in the context of separate embodiments can also be implemented in combination in a single embodiment. Conversely, various features that are described in the context of a single embodiment can also be implemented in multiple embodiments separately or in any suitable subcombination. Moreover, although features may be described above as acting in certain combinations and even initially claimed as such, one or more features from a claimed combination can in some cases be excised from the combination, and the claimed combination may be directed to a subcombination or variation of a subcombination.

Similarly, while operations are depicted in the drawings in a particular order, this should not be understood as requiring that such operations be performed in the particular order shown or in sequential order, or that all illustrated operations be performed, to achieve desirable results. In certain circumstances, multitasking and parallel processing may be advantageous. Moreover, the separation of various system components in the embodiments described above should not be understood as requiring such separation in all embodiments, and it should be understood that the described program components and systems can generally be integrated together in a single software product or packaged into multiple software products.

Thus, particular embodiments of the subject matter have been described. Other embodiments are within the scope of the following claims. In some cases, the actions recited in the claims can be performed in a different order and still achieve desirable results. In addition, the processes depicted in the accompanying figures do not necessarily require the particular order shown, or sequential order, to achieve desirable results. In certain implementations, multitasking and parallel processing may be advantageous. 

What is claimed is:
 1. A method of providing data related to conversion paths, the method comprising: receiving information regarding a plurality of conversion paths, wherein each conversion path comprises one or more user interactions each user interaction having a corresponding position in the conversion path, wherein the corresponding position of each user interaction in the conversion path is based on a number of user interactions from a conversion interaction or based upon a date of the conversion interaction, wherein a user interaction of the one or more user interactions comprises characteristics that are associated with the user interaction, wherein each conversion path corresponds to a user, and wherein each conversion path ends with the conversion interaction; receiving a list of channel groups, wherein each channel group comprises one or more group rules, and wherein each channel group comprises a group name; determining, using a processor, a channel group for each user interaction of each conversion path of the plurality of conversion paths based on the one or more group rules of each channel group of the list of channel groups and the characteristics associated with the user interaction, wherein the determined channel group has a group rule corresponding to a characteristic of the characteristics associated with the user interaction; aggregating, using the processor, into groups each user interaction of each conversion path of the plurality of conversion paths based on the determined channel group for each user interaction; receiving a selection of one or more aggregated groups of user interactions to visualize; and providing display data representative of one of a line chart, a bar chart, a histogram, or a pie chart based on the corresponding position in the conversion path of each user interaction of the selected one or more aggregated groups of user interactions.
 2. The method of claim 1, wherein the corresponding position of each user interaction within each conversion path is based on a number of interactions from the conversion interaction.
 3. The method of claim 2, further comprising filtering out, prior to determining a channel group for each user interaction, user interactions whose positions are greater than a value.
 4. The method of claim 3, wherein the value corresponds to a lookback window.
 5. The method of claim 4, wherein the value is less than the lookback window.
 6. The method of claim 3, wherein the value is determined such that a pre-determined percentage of user interactions are filtered out.
 7. The method of claim 1, wherein each of the plurality of conversion paths corresponds with a cookie that is at least as old as a lookback window.
 8. The method of claim 1, wherein the corresponding position of each user interaction within each conversion path is based upon a date of the conversion interaction.
 9. The method of claim 8, further comprising filtering out, prior to determining a channel group for each user interaction, user interactions whose date is before a value.
 10. The method of claim 9, further comprising filtering out, prior to determining a channel group for each user interaction, user interactions whose date is after a second value.
 11. The method of claim 9, wherein the value corresponds to a lookback window.
 12. The method of claim 11, wherein the value is less than the lookback window.
 13. The method of claim 9, wherein the value is determined such that a pre-determined percentage of user interactions are filtered out.
 14. A system comprising: a data processor; and a storage device storing instructions that, when executed by the data processor, cause the data processor to perform operations comprising: receiving information regarding a plurality of conversion paths, wherein each conversion path comprises one or more user interactions each user interaction having a corresponding position in the conversion path, wherein the corresponding position of each user interaction in the conversion path is based on a number of user interactions from a conversion interaction or based upon a date of the conversion interaction, wherein a user interaction of the one or more user interactions comprises characteristics that are associated with the user interaction, wherein each conversion path corresponds to a user, and wherein each conversion path ends with the conversion interaction; receiving a list of channel groups, wherein each channel group comprises one or more group rules, and wherein each channel group comprises a group name; determining a channel group for each user interaction of each conversion path of the plurality of conversion paths based on the one or more group rules of each channel group of the list of channel groups and the characteristics associated with the user interaction, wherein the determined channel group has a group rule corresponding to a characteristic of the characteristics associated with the user interaction; aggregating into groups each user interaction of each conversion path of the plurality of conversion paths based on the determined channel group for each user interaction; receiving a selection of one or more aggregated groups of user interactions to visualize; and providing display data representative of one of a line chart, a bar chart, a histogram, or a pie chart based on the corresponding position in the conversion path of each user interaction of the selected one or more aggregated groups of user interactions.
 15. The system of claim 14, wherein the corresponding position of each user interaction within each conversion path is based upon a number of interactions from the conversion interaction, wherein the operations further include filtering out, prior to determining a channel group for each user interaction, user interactions whose positions are greater than a value, and wherein the value is determined such that a pre-determined percentage of user interactions are filtered out.
 16. The system of claim 14, wherein the corresponding position of each user interaction within each conversion path is based on a date of the conversion interaction, wherein the operations further include filtering out, prior to determining a channel group for each user interaction, user interactions whose date is before a value.
 17. A non-transitory tangible computer-readable medium having instructions stored thereon, the instructions executed by a processor to perform: receiving information regarding a plurality of conversion paths, wherein each conversion path comprises one or more user interactions each user interaction having a corresponding position in the conversion path, wherein the corresponding position of each user interaction in the conversion path is based on a number of user interactions from a conversion interaction or based upon a date of the conversion interaction, wherein a user interaction of the one or more user interactions comprises characteristics that are associated with the user interaction, wherein each conversion path corresponds to a user, and wherein each conversion path ends with the conversion interaction; receiving a list of channel groups, wherein each channel group comprises one or more group rules, and wherein each channel group comprises a group name; determining a channel group for each user interaction of each conversion path of the plurality of conversion paths based on the one or more group rules of each channel group of the list of channel groups and the characteristics associated with the user interaction, wherein the determined channel group has a group rule corresponding to a characteristic of the characteristics associated with the user interaction; aggregating into groups each user interaction of each conversion path of the plurality of conversion paths based on the determined channel group for each user interaction; receiving a selection of one or more aggregated groups of user interactions to visualize; and providing display data representative of one of a line chart, a bar chart, a histogram, or a pie chart based on the corresponding position in the conversion path of each user interaction of the selected one or more aggregated groups of user interactions.
 18. The non-transitory tangible computer-readable medium of claim 17, the instructions executed by the processor further perform filtering out, prior to determining a channel group for each user interaction, user interactions whose corresponding positions are greater than a value, and wherein the value is determined such that a pre-determined percentage of user interactions are filtered out, wherein the corresponding position of each user interaction within each conversion path is based upon a number of interactions from the conversion interaction.
 19. The non-transitory tangible computer-readable medium of claim 17, the instructions executed by the processor further perform filtering out, prior to determining a channel group for each user interaction, user interactions whose date is before a value, wherein the corresponding position of each user interaction within each conversion path is based on a date of the conversion interaction. 